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ABSTRACT 

We study the topology of cosmic large-scale structure through the genus statistics, using galaxy 
catalogues generated from the Millennium Simulation and observational data from the latest Sloan 
Digital Sky Survey Data Release (SDSS DR7). We introduce a new method for constructing galaxy 
density fields and for measuring the genus statistics of its isodensity surfaces. It is based on a Delaunay 
tessellation field estimation (DTFE) technique that allows the definition of a piece-wise continuous 
density field and the exact computation of the topology of its polygonal isodensity contours, without 
introducing any free numerical parameter. Besides this new approach, we also employ the traditional 
approaches of smoothing the galaxy distribution with a Gaussian of fixed width, or by adaptively 
smoothing with a kernel that encloses a constant number of neighboring galaxies. Our results show 
that the Delaunay-based method extracts the largest amount of topological information. Unlike the 
traditional approach for genus statistics, it is able to discriminate between the different theoretical 
galaxy catalogues analyzed here, both in real space and in redshift space, even though they are 
based on the same underlying simulation model. In particular, the DTFE approach detects with high 
confidence a discrepancy of one of the semi-analytic models studied here compared with the SDSS 
data, while the other models are found to be consistent. 

Subject headings: large-scale structure of universe - methods: statistical - cosmology: observations 



1. INTRODUCTION 

According to the standard cosmological model, the 
large-scale structure in the present Universe has formed 
as a result of gravitational amplification of primordial 
density fluctuations which were presumably seeded by 
quantum fluctuations during the early phases of cosmic 
inflation (i Guthl 119811: iGuth fc Pilll982[ ). The statistical 
properties of the primordial fluctuations are well con- 
strained by cosmic microwave background (CMB) ob- 
servations, as for example measured by the Wilkinson 
Micr owave Anisotropy Probe (WMAP) (jDunklev et al.l 
I2009f ). If the Universe is indeed dominated by cold 
dark matter, the statistics of the initial density field are 
thought to be well preserved in the large-scale distribu- 
tion of galaxies, as mapped, e.g., by the Sloan Digital 
Sky Surve y (SPSS) or the 2d F Galaxy Redshift Survey 
(2dFGRS. ICoUess et alll2001| ). However, it remains an 
important task to test whether the observed galaxy dis- 
tribution is indeed consistent with the expectations for 
the prevailing ACDM cosmology. The latter can be ac- 
curately obtained by evolving the initial conditions con- 
strained by the CM B forward in time w ith the ordinary 
laws of physics (e.g. ISpringel et ani2006( ). 

Most studies of the large-scale distribution of matter 
and galaxies are based on low-order two-point statistics, 
i.e. the two-point correlation function and its Fourier 
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transform, the power spectrum (|Peebleslll98Cl( ). While 
these provide the most basic characterization of the 
statistics of a set of discrete points, they only fully char- 
acterize Gaussian random fields. In order to detect de- 
viations from Gaussianity and the higher-order corre- 
lations that develop as a result of non-linear growth 
of structure even from Gaussian initial conditions, dif- 
ferent statistical measures are required. For example, 
this can be d one in terms of t he one-point density dis- 
tribution (Ko fman et al.l 119941) , or through the three- 
point correlation function and the bispectrum. An in- 
teresting alternative are more direct measures of the 
geometry and topology o f cosmic large-sca le structure, 
such as shape stat istics (iDave et al.l 119971 ) , Minkowski 
functionals ( Mecke et al.l |1994[) . or the genus statistics 
(|Gott et al.l 1198611 . These morphological measures of 
large-scale structure are sensitive to higher-order correla- 
tions and provide important descriptive statistics of the 
cosmic web. 

In this pa per, we focus on the genus statistic first 
proposed by iGott et ahl (|1986f) . It has bee n widely 
applied du ring t he past two d ecades (|G ptt e t al. 1989; 
Park et all 1992; M oore et a l. 1992; Rhoads ct al. 199i 
Voeelev et al.,.1994lCanavezes et al..l998;,Spr ingcl ct aD 
19981: IHikage'eTap 1200^120031: IGott etaim OOS. 200f 
James et al.l 1200 9^. in particular due to its sensitivity 
to non-Gaussianity. The genus measures the topology 
of isodensity surfaces of a smoothed mass density field. 
Therefore, it is sensitive to global aspects of the density 
maps. It can test directly whether the geometry of the 
observed cosmic web is consistent with theoretical expec- 
tations for the ACDM cosmology. 

However, the genus statistics is strongly affected by 
the density reconstruction techniques that are required 
to allow a consideration of isodensity contours in the first 
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place. The most commonly employed technique for this 
purpose uses simple Gaussian smoothing of the point set 
with a fixed kernel size. However, the need to avoid se- 
vere under-sampling in void regions usually forces one to 
significantly over-smooth the strongly clustered regions 
of the cosmic web, like the nodes and filaments. A spa- 
tially adaptive kernel in real space may therefore be a 
better choice, but just like the fixed smoothing, it also 
introduces an unwanted numerical parameter into the 
analysis, namely the number of neighbors used for defin- 
ing the adaptive kernel. 

In this work, we therefore propose a novel method that 
provides for a genus analysis without any free parame- 
ter and with minimal smoothing, thereby allowing an 
extraction of the maximum amount of topological in- 
formation from the point set. The new approach is 
based on a three-dimensional density field reconstruction 
based on the Delaunay Tessellation Field Estimator tech- 
nique (jSchaap fc van de Wevgaertll2000t iPelupessv et al.l 
|2003() . This method allows the unambiguous definition 
of a continuous, piece- wise linear density field on a tetra- 
hedral mesh, constructed directly from the coordinates 
of the point set. This field is consistent, i.e. its volume 
integral reproduces the total mass. Furthermore, isoden- 
sity contours can be specified exactly in this field; they 
become polygonal surfaces for which the genus can also 
be calculated exactly. 

We apply both our new method as well as the tradi- 
tional approaches of fixed and adaptive smoothing to a 
number of different galaxy formation models constructed 
from the Millennium Simulation. We also compare genus 
results obtained for volume-limited galaxy samples con- 
structed from the SDSS data release 7 to matching mock 
surveys we created for the theoretical galaxy catalogues. 
This yields an important test of the consistency between 
the topology of the observed galaxy distribution and the 
theoretical models, as well as direct information about 
the improvement in discriminative power made possible 
by our new method for measuring the genus. 

This paper is organized as follows. In Sj2] we briefiy 
describe the genus statistics, and introduce the different 
density reconstruction methods we use in this study. In 
fJSl we describe the data that are used to perform our 
genus analysis: the galaxy catalogues constructed from 
the Millennium Simulation, the SDSS observations, and 
the mock galaxy redshift surveys we construct for com- 
parison. 21 presents our results for the genus curves of 
the observational and simulated data. Finally, we discuss 
our findings in Sj5] 

2. GENUS STATISTICS 

The ge nus is a me a sure o f the topology of a surface. 
Following iGott et al.l (jlQSGT ). we define the genus as 

G ~ number of holes 

— number of isolated regions, (1) 

which differs from the usual mathematical definition of 
the genus, 5math = G -I- 1, by a constant offset of 1. 
In our definition, an isolated sphere has a genus of —1, 
and a torus has a genus of 0. In the analysis of cosmic 
large-scale structure, it is customary to analyze the genus 
of isodensity surfaces as a function of density threshold 
for a density field that is suitably created from a tracer 



distribution of discrete points (e.g. galaxies). The actual 
measurement of the genus can be carried out by means 
of the Gauss-Bonnet theorem. 
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which relates the genus to the integral of the Gaussian 
curvature, k = l/(?'i?'2); over the surface. Here ri and 
r2 are the principal radii of curvature of the surface at 
the integration point. 

Interestingly, for a Gaussian random density field, the 
genus per unit v olume, q(v) = G(v)/y , is given by an 
analytic formula (|Hamilton et al.|[l986D . 
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depends only on the second moment 



J P{k)W{k)d^k 
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of the shape of the (smoothed) power spectrum P{k). 
Here W{k) denotes the Fourier transform of the smooth- 
ing kernel. The dimensionless parameter v — S-t/a en- 
codes the density threshold St of the isodensity surface 
that is considered, expressed in units of the rms dis- 
persion a of the field. The density field itself is ex- 
pressed as dimensionless density fluctuation field, S(x.) = 
P(x)/(p)-l. 

For general density fields, it is more useful to define v 
in terms of the fraction / of the volume above the density 
threshold, through the equation 



(6) 



where erfc(a;) = 2tt~^^^ e^* dt is the complementary 
error function. The v = contour corresponds to the 
median volume fraction contour (/ = 50%). We will 
employ this definition for labeling our density contours, 
but note that for a Gaussian field one then still has v — 
St /a. 

Deviations of the shape 5(1^) of a measured genus curve 
from the form of Eqn. ([3]) can be interpreted as a mea- 
sure of non-Gaussianity, or equivalently, of the cumula- 
tive impact of higher-order correlations. Our procedure 
for quantifying such deviations is to first find the best- 
fitting genus curve gmii^) expected for Gaussian random 
phases through a least-squares fit of Eqn. ^ to the mea- 
sured curve. This yields, in particular, the amplitude of 
the measured genus curve. 

Next, one can define different meta-statistics to quan- 
tify the differences of the measured genus curve relative 
to the Gaussian shape. For example , the s hift parameter 
Av has been defined by IPark et al.l (jl992l) as 
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which measures the horizontal shift of the central por- 
tion of the genus curve. The Gaussian random phase 
curve (Eqn. [S]) has Ai/ = 0. A negative value of t^v is 
sometimes called a "meatball shift" because it signifies a 
greater prominence of isolated over-densities in the den- 
sity field, while a void dominated field leads to Az/ > 0, 
called a "bubble shift" . 

Since the two negative extrema of the genus curve rep- 
resent a measure for the frequency of isolated over- and 
under-densities, the abundance of clusters and voids rel- 
ative to that expected from the best-fit Gaussian genus 
curve can be qua ntified by two parameters Ac and Ay 
(|Park et al.ll2005D . defined as 



ill g{v) dv 

An — 
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where the integration interval of Ac {Ay) is roughly cen- 
tered at ^' = -s/S and v — — -v/S, respectively, which are 
the locations of the minima in the best-fit Gaussian genus 
curve. At these points the sensitivity to the number of 
clusters and voids is greatest. Acy < 1 (^c,y > 1) 
means that fewer (more) isolated clusters or voids are 
observed than those expected from the best-fit Gaussian 
curve. 

In principle, additional genus meta-statistics are con- 
ceivable, and one may even use principal components 
analysis to determine the most sensitiv e measures for 
shape distortions of the genus curve (jSpringel et al.l 
[1998.) . Also, the amplitude of the genus curve relative 
to the genus of a field with the same power spectrum 
(jCanavezes et al.1 11998H can be used as a quantitative 
measure for the cumulative effect of higher order corre- 
lations. However, in this study, we will restrict ourselves 
to the simple genus statistics described above. 

2.1. Fixed Smoothing 

As is clear from the above, a given discrete point set 
first needs to be transformed to a continuous density field 
to allow the genus analysis. The simplest approach for 
this is to smooth the point set with a Gaussian kernel of 
the form 



W{r) = 
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where A is a fixed smoothing length. In practical terms, 
a Cartesian grid with spacing d can be used to repre- 
sent the density field, where the grid spacing d needs 
to be smaller than A by a factor of a few to provide 
adequate sampling. Sampling constraints also impose a 
lower bound on reasonable values for A, which should 
be chosen (considerably) larger than the mean spacing 
of points in the sample, otherwise the estimated density 
field will be dominated by noise. In practice, we choose 
A to be within 5-10 h'^Mpc. 

The actual measurement of the genus can then be car- 
ried out on the Cartesian grid in the following way. For 
a given prescribed density threshold, one considers the 



surface of all cell faces that lie between pairs of cells on 
opposite sides of the density threshold. This surface is 
then an approximation to the true isodensity contour of 
the underlying density field. In fact, the topology of the 
discretized surface, which is composed of squares, will be 
the same as that of the smooth true isodensity contour 
(modulo very small changes if the grid spacing is not fine 
enough) , except that all the curvature of the real surface 
is now compressed into the corners of the discretized con- 
tour. All the curvature is then represented by a set of 
Dirac delta functions at these corners, each giving an in- 
tegrated curvature equal to the angle deficit relative to 
360 degrees for the sum of the angles between the inci- 
dent edges. An application of the Gauss-Bonnet theorem 
to measure the genus of the surface then effectively re- 
duces to summing up the angle deficits at all vertices 
of the contour. This approach is exploited by the CON- 
TOUR algorithm (Weinbe rg, 1988i) . which we have reim- 
plemented in a C code for the analysis of the smoothed 
density fields in this paper. 

2.2. Adaptive Smoothing 

A serious disadvantage of the fixed smoothing tech- 
nique is that a single fixed smoothing length is not well 
adjusted to highly clustered particle distributions, like 
the ones we expect for galaxies of the cosmic web. If 
the smoothing is chosen sufficiently large to avoid under- 
sampling of the voids, the filaments and clusters will typ- 
ically be over-smoothed, limiting the amount of informa- 
tion that can be extracted. 

A better match to the spatially varying sampling den- 
sity can be obtained if an adaptive smoothing kernel 
is used, where the size of the kernel is determined by 
the local sampling density. In our study, we employ the 
adaptive kernel estimation techniques that is used in the 
well-known smoothed particle hydrodynamics (SPH) ap- 
proach. For the smoothing kernel, we adopt the common 
choice in SPH, a spherically sy mmetric spline of the form 
(|Monaghan fc Lattanziolll985D 
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W{r;h) = 
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(11) 

where the adaptive smoothing length /i(x) at each point 
X is defined as the distance to the iVgph-th nearest neigh- 
bor such that the number of particles inside the smooth- 
ing radius /i(x) is equal to a constant value iVsph- We 
then compute the density field as 



N 



p(x) = Yl ^^^(^^ - ^; ^W)' 



(12) 



where rrii is the mass of the i-th particle. This corre- 
sponds to the so-called gaifeer app roach to define a den- 
sity field C Hernquist &: Katzlll989l ). 

Again, we shall use a finely spaced Cartesian grid to 
represent the smoothed field. We note however that it is 
here considerably more difficult to make this mesh fine 
enough everywhere. In fact, for clustered distributions, 
some unwanted over-smoothing of the densest regions 
due to a too coarse mesh can usually not be avoided. 
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Fig. 1. — Sketch of the curvature calculation for isodensity surfaces constructed with the Delaunay Tessellation Field Estimator (DTFE) 
technique. The drawing on the left illustrates the three Delaunay tetrahedra that share a common edge DE in the tessellation, where for 
clarity the tetrahedra have been displaced from each other. The second drawing shows the real geometry of the involved subgroup of 5 
points (galaxies) in the point set. We assume that the density threshold pt of the contour lies between the densities po and estimated 
for the points D and E. We can then compute the point q in which the isodensity contour intersects the edge DE. Similarly, the isodensity 
contour intersects the outer edge DA-AE in point r, the lines DB-BE in point s, and DC-CE in point t (see third drawing from left). Finally, 
we can compute the angles o, fi and 7 enclosed by the edges of the isodensity contour that are incident on point q (rightmost drawing). 
This yields the angle deficit at this corner of the isodensity contour, providing a measurement of the integrated curvature. 



2.3. Genus calculation through a Delaunay tessellation 

The above hmitations of the fixed and adaptive 
smoothing techniques have motivated us to develop a 
more general approach for measuring the topology of a 
point set. The new method we propose below does not 
rely on any additional parameter, it is free of grid-spacing 
limitations, and at the same time extracts the maximum 
topological information from the point set. 

In mathematics and computational geometry, the De- 
launay tessellation for a set of points is the uniquely 
defined and volume-covering tessellation of mutually 
disjoint tetrahedra, in which no circumsphere of any 
tetrahedron con t ains one of the points in its interio r 
(|Delaunavl [1981 lOkabe et all ^MOl llllian et al. ll2008l ). 
Connecting the centers of the circumscribed spheres pro- 
duces the Voronoi tessellation, which is the topological 
dual of the Delaunay tessellation. 

There are different possibilities to use the Delaunay or 
Voronoi tessellations for density estimates. For example, 
one may use the Voronoi volumes V^°^ , and simply esti- 
mate the density within each Voronoi cell as p^°^ = . 
Another possibility lies in defining the density as 



where now Wi is the volume of the contiguous Delaunay 
region around point i, i.e. the sum Wi = Fj^'^' of the 
volumes of all Delaunay tetrahedra that have i as one of 
their vertices. Note that each Delaunay tetrahedron is 
contributing to the contiguous Delaunay region of four 
points, hence the multiplication by 4 in Eqn. ()13p . In 
the De launay Tessellation Field Estimator (DTFE) tech- 
nique (jSchaap fc van de Wevgaertll2000l : iPelupessv et al.l 
120031) , one estimates densities for each point in this way, 
and then linearly extends the density estimates at the 



corners of each tetrahedron to a volume filling density 
field. To this end, one simply uses tri-linear interpolation 
of the density values at the corners of each tetrahedron, 
in the basis of the three principal vectors that span the 
tetrahedron. This creates a continuous, piece-wise linear 
density field, in which the gradient within each tetrahe- 
dron is constant. Furthermore, it is easy to see that the 
volume integral of this field reproduces the sum of the 
particle masses exactly. 

The above DTFE method allows a unique, parameter- 
free construction of polygonal isodensity contours for 
a given set of points, without any restriction on dy- 
namic range, and without requiring a smoothing pro- 
cedure. Furthermore, we can readily measure the inte- 
grated Gaussian curvature (and hence the genus) of this 
surface by generalizing the technique employed for Carte- 
sian grids. This is because again all the curvature will 
be compressed into the corners/ vertices of the polygonal 
isodensity surface. These points all lie on edges of the 
Delaunay tessellation, because only there three or more 
tetrahedra with different gradients can meet. 

An example for the geometrical situation is sketched in 
Figure [TJ For every edge in the Delaunay tetrahedraliza- 
tion we can readily decide whether or not it is intersected 
by the isodensity contour corresponding to a prescribed 
density threshold pt- This happens when the densities 
of the two endpoints of the edge lie on different sides of 
the density threshold. Provided this is the case, we can 
straightforwardly find the intersection point on the edge 
which has density pt- Next, we have to determine the 
deficit angle around this point. To this end, we visit all 
tetrahedra that share the given edge and find the points 
on their outer edges that have density pt- Based on sim- 
ple geometrical operations we can then determine the 
total angle deficit around the intersection point, which 
corresponds to the integrated curvature at this corner 
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of the isodensity contour. Summing these angles over 
all Delaunay edges that have an intersection point then 
yields the genus of the surface. 

We can also obtain an exact measurement of the en- 
closed volume fraction within the isodensity contour. To 
this end, we simply loop over all tetrahedra and check 
whether any of their points lie below or above the density 
threshold. If all four points lie below pt, the full volume 
of the tetrahedron is counted in the enclosed volume frac- 
tion. If one corner lies below pt, then the intersections on 
the three incident edges can be calculated, yielding to- 
gether with the low corner a tetrahedral volume that is 
counted towards the volume fraction. Similarly, if three 
points are below pt, then the volume complement for 
the corner that lies above the density threshold can be 
readily computed. Only in the case where two points lie 
below and two above pt, a bit of more work is required. 
Here the volume below the threshold can be obtained as 
a composite of three suitably defined tetrahedra. Finally, 
if all four points lie above the threshold, there is evidently 
no contribution to the enclosed volume fraction. 

In our data analysis below, we will try out this method 
for the first time in the topological analysis of cosmic 
large-scale structure. To construct the Delaunay mesh, 
we e mploy the tesse llation engine of the parallel AREPO 
code (jSpringell 120091 ). 

Before we apply this new approach to galaxy data, it 
is useful to illustrate its response to clustering with the 
help of an N-body simulation. To this end, we have run a 
small coUisionless N-body simulation with 128"^ particles 
in a periodic box of size L = 50 /i~^Mpc on a side, using 
the cosmological parameters of the Millennium Simula- 
tion. In Figure [2l we show eight genus curves measured 
at different times between the starting redshift z — 127 
and the final epoch of z = 0. First, during the mildly 
non-linear evolution, the genus amplitud e declines, due 
to th e development of phase correlations (jSpringel et al.l 
Il998f ). After z ~ 7, strong non- linear evolution sets in, 
and the number density of virialized halos rapidly in- 
creases. As a result, the genus curve develops a "meatball 
shift" , and an ever large asymmetry between the minima 
on the v < and v > sides. Interestingly, the max- 
imum of the genus curve first begins to increase in this 
non-linear phase, but then declines again at late times. 
The latter is probably related to the slowly declining to- 
tal number of halos at late times, as they aggregate into 
ever larger structures, and the thinning-out of the cos- 
mic web. For comparison, we also include in Figure[2]the 
genus curve for a random distribution of an equal num- 
ber of points, shown as a dashed line, which yields a very 
different shape compared to the N-body simulation at all 
times. Overall, it is therefore clear that the shape of the 
genus curve measured with the DTFE method encodes 
a wealth of interesting information about the clustering 
pattern which is not readily accessible by other statistical 
measures. 

3. DATA 

3.1. Galaxy catalogues based on the Millennium 
Simulation 

The simulation analyzed in this study is the so called 
"Millennium Simulation" (MS) which evolved N = 
2160^ ~ 1.0078 X 10^" particles from redshift z = 127 to 



the present in a cubic box of length Lbox = 500/i~^Mpc 
on each side (jSpringel et al.l [20051 ). The cosmological 
parameters in the Millennium Simulation are flm = 
+ = 0.25, f^b = 0.045, h = 0.73, Qa = 0.75, 
n = 1, and erg = 0.9, where the Hubble constant is given 
as Hq = 100 ft. kms^^ Mpc^^, erg is the rms linear mass 
fluctuation in a sphere of radius 8 h~^Mpc at z = and 
n is the spectral index of the primordial power spectrum. 
The simulation was carried out wi th the massively paral- 
lel GADGET-2 code |SpriiigcT'2005) . Gravitational forces 
were computed with the TrcePM method, where long- 
range forces are calculated with a classical particle-mesh 
method while short-range f orces are determined with a 
hierarchical tree approach (|Barnes fc HutlHOSl . The 
Plummer equivalent gravitational softening length was 
5 ft^^kpc, which can be taken as the spatial resolution of 
the simulation. 

Dark matter h alos and subhalos were identified 
with the FOF (iDavis et al.l Il985t) and SUBFIND 
(jSpringel et al.ll200lD algorithms, respectively. Based on 
the halos and subhalos at all output times of the simu- 
lation, detailed merging history trees were constructed, 
which form the basic input required by subsequently ap- 
plied semi-analytic models of galaxy formation. 

The first galaxy catalogue we consider is constructed 
based on the conditio nal luminos it y fun ction (CLF) ap- 
proach developed by l Yang et all (j2003f ). Halos of dif- 
ferent masses are populated with galaxi es according to 
the b est fit CLF parameters listed in Cac ciato et al.l 
(j2009f l. properly converted to the MS cosmology. We as- 
sign each galaxy an r-band absolute magnitude "-^M^ — 
5 log ft, which is K + E corrected to redshift z = 0.1 
(Bla nton et al.ll2003f l. Note that by construction, the 
galaxy catalogue has a luminosity function that is in very 
good agreement with the SDSS observations. From this 
catalogue, we select a sample of galaxies in the abso- 
lute magnitude range -21.7 < °-^Mr - 5 log ft < -20.1 
for further analysis. While the choice of this abso- 
lute magnitude range is somewhat arbitrary, it yields a 
good compromise of a fairly large volume and a high 
number density of galaxies as covered by the SDSS ob- 
servations (see Fig. [3]). The total number of galaxies 
and mean separation in this sample are 665, 914 and 
5.73 ft~-^Mpc, respectively. In order to assess statisti- 
cal uncertainties due to cosmic variance, we divide the 
galaxy catalogue corresponding to the full simulation 
box into 8 subsamples with ibox — 250ft~^Mpc each, 
and consider the scatter among the results for the in- 
dividual sub-samples. To investigate the dependence of 
the genus statistics on galaxy absolute magnitudes, we 
also construct two different samples with the absolute 



magnitude ranges —19.8 < 



0.1 



and -18.8 < 



0.1 



M, 



M^ 
5 log h < 



5 log ft < -19.2 
-18.3, respectively. 



which have a similar number of galaxies as in the case of 
-21.7 < o ^M^ - 51ogft < -20.1. 

The second galaxy catalogue we study is taken from 
iCroton et al.l ([2006) □. This semi-analytic galaxy forma- 
tion model includes a total of about 9 million galaxies 
at z = 0. After converting the absolute magnitudes of 
galaxies hy a K + E correction to redshift z — 0.1, we 
also construct from this catalogue three different samples 

^ The semi-analytic galaxy catalogue is publicly available at 
http: / /www. mpa-garching.mpg.de/galform/agnpaper 
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Fig. 2. — Genus curves calculated with the DTFE technique at different times for a dark matter N-body simulation of the ACDM 
cosmology. The panel on the left illustrates the substantial modification of the shape of the genus curve as a function of time, first through 
mildly non-linear evolution, and finally through the strong clustering in the non-linear regime. The dotted line shows the curve for a random 
distribution of an equal number of points (128"^) in the box of size V = {50h~^Mpc) . The panel on the right repeats the low-redshift 
measurements, for clarity. In this strongly clustered regime, the genus shows a marked "meatball" shift, and a very strong asymmetry. 
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Fig. 3. — SDSS galaxies in redshift-absolute magnitude space. 
The rectangle indicates the boundary definition we adopted to de- 
fine volume limited subsamples. 

in the same absolute magnitude ranges that we described 
above. 

Finally, the third the oretical galaxy cat alogue used 
in this paper is that of iBower et al.l ()2006D . The cat- 
alogue consists of a total of about 24 million galax- 
ies. Again, after K + E correcting the absolute mag- 
nitudes and converting them to redshift z — 0.1, we 
construct three galaxy samples in the absolute mag- 
nitude ranges -21.7 < °-^Mr - 51og/i < -20.1, 
-19.8 < "-^Mr - 51og/i< -19.2 and -18.8 < °-^Mr - 
51og/i < —18.3, respectively, as before. Note that as 



discussed in iLiu et"al1 (|2010l ). galaxies with given lumi- 
nosity / stellar masses have quite diffe rent distribut i ons in 
halo s of different masses when the IBower et al.l ()2006l ) 
and De Lucia fc BlaizotI ()2007[ ) m odels are c ompared, 
whereas the latter is very similar to lCroton et al. (,2006) • 
Note that in each of the absolute magnitude ranges, 
there are slight differences in the numbers of galaxies 
among the three catalogues. Since the genus curve can 
quite strongly d epend on the to tal number of galaxy 
in consideration ([Protogeros fc W einberg 1997), we ran- 
domly down-sample the ga laxies where needed according 
to the IBower et al.l ()2006D catalogue within the magni- 
tude range -21.7 < °-'^Mr - Slog /i < -20.1 (a few 
percent less than others), so that all our samples have 
exactly the same number of galaxies. 

3.2. Sloan Digital Sky Survey 

The Sloan Digital Sky Survey (SDSSl^SeEaDEOOO), 

one of the most influential galaxy redshift surveys to 
date, is a multi-filter imaging and spectroscopic sur- 
vey to explore the large-scale distribution of galaxies 
and quasars. Here we make use of the New York Uni- 
versity Value-Adde d Galaxy Catalogue (NYU-VAGC; 
iBlanton et al."20051. which is based on the SDSS Data 
Release 7 (Abazaiian et al. 2009). DR7 marks the com- 
pletion of the survey phase known as SDSS-II. It fea- 
tures a spectroscopy that is now complete over a large 
contiguous area of the Northern Galactic cap, closing 
the gap which was present in previous data releases. 
The continuity over this large area is a great advance- 
ment and critical to the statistics of large-scale struc- 
ture. From the NYU-VAGC, we select all galaxies in the 
Main Galaxy Sample with an extinction corrected appar- 
ent magnitude brighter than r = 17.72, with redshifts in 
the range 0.01 < z < 0.20 and with a redshift complete- 
ness Cz > 0.7. The extracted SDSS galaxy catalogue 
contains a total number of 639, 555 galaxies. Note that 
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in this catalogue, a very small fraction of galaxies have 
redshifts that are borrowed from the Korea Institute for 
Advance d Study (KIAS) Value-Added Galaxy Cat alog 
(VAGC) (|Park fc Choil[2(M [Ghclenil[2^ 

Since the complicated survey geometry may potentially 
impact the genus measurements significantly, we select 
only galaxies with right ascension a and declination 5 in 
the ranges 120° < a < 240° and 10° < (5 < 55°, which 
results in a loss of ^ 44% of the galaxies, of which how- 
ever nearly half are not in the coherent region. With this 
selection, the remaining SDSS survey volume will suffer 
less from edge effects, and we have a precise understand- 
ing about where the edges lie in our genus calculations. 
We note that for our new DTFE method we always need 
to put the data points into a large enclosing box for which 
formally periodic boundaries are imposed in order to fa- 
cilitate the construction of the Delaunay tessellation. We 
have checked that filling the residual volume of this box 
with a sparse grid of background points or leaving it 
empty has no significant influence on our results, con- 
firming that the edge effects are sufficiently small (see 
also the tests in Fig. [8] below). 

Furthermore, we construct a volume-limited galaxy 
sample with absolute mag nitudes -21.7 < ^-^M^ - 
h\ogh < —20.1. As shown in the redshift-absolute mag- 
nitude diagram (Fig. [3]), this corresponds to the redshift 
range 0.0197 < z < 0.1187 when the apparent magnitude 
cut (10.2 < nir < 17.7) is applied. The final volume- 
limited sample we use based on the SDSS observations 
contains 92, 662 galaxies. 

3.3. Mock galaxy redshift surveys 

The end product of the CLF approach and the SAMs 
considered here is a large sample of galaxies distributed 
over the dark matter halos in the cubic simulation box 
of the Millennium Simulation. One approach would be 
to compare these galaxy samples directly with the SDSS 
data. However, this ignores the fact that the latter is 
affected by observational selection effects, especially the 
survey geometry and redshift distortion effects. To make 
an "apples-to-apples" comparison with the SDSS data, 
which is essential especially for the genus statistics, we 
construct mock galaxy redshift surveys (MGRSs) for the 
three galaxy catalogues we discussed in Section [3?T] 

Our const ruction of the MG RS here is similar to tha t 
described in lYang et all (|2004l ) (see also lLi et al] 120071 ). 
First, we stack 3x3x3 replicas of the simulation box and 
place a virtual observer at the center of the stacked boxes. 
Next, we assign each galaxy (a, 6) coordinates and re- 
move the ones that are outside the mocked SDSS sur- 
vey region. For each model galaxy in the survey region, 
we compute its redshift (which includes the cosmologi- 
cal redshift due to the universal expansion, the peculiar 
velocity, and a 35kms~^ Gaussian line-of-sight velocity 
dispersion to mimic the redshift errors in the data), its 
r-band apparent magnitude (based on the r-band lumi- 
nosity of the galaxy). We eliminate galaxies that are 
fainter than the SDSS apparent magnitude limit, and in- 
corporate the position dependent incompleteness by ran- 
domly eliminating galaxies according to the completeness 
factors obtained from the survey masks provided by the 
NYU- VAGC. To have a measure of the error on the genus 
statistics, we construct 6 MGRSs for the CLF galaxy cat- 
alogue by rotating the simulation boxes. 



Finally, similar to the SDSS observational data, we 
construct volume-limited galaxy samples with absolute 
magnitudes -21.7 < °-^Mr - 51og/i < -20.1 and a red- 
shift range of 0.0197 < z < 0.1187. Again, we downsam- 
ple the resulting galaxy catalogues according to the one 
with least number of galaxies if needed so that they have 
the same number of galaxies for the genus measurements. 

4. GENUS STATISTICS OF COSMIC 
LARGE-SCALE STRUCTURE 

Reconstructing a density field from a set of irregu- 
larly sampled points is a key step in measuring the genus 
curve. In this section, we investigate the effects of dif- 
ferent reconstruction methods for the density field on 
the genus and its related statistics. Since we have two 
types of galaxy catalogues, with and without SDSS sur- 
vey selection effects, we present our measurements in two 
subsections. 

4.1. Results for galaxies in real space 

We first carry out genus measurements for the three 
galaxy catalogues in the full Millennium Simulation box, 
where periodical boundary conditions apply. Note that 
we are here measuring the signal in real space, i.e., the 
distribution of galaxies is not affected by the redshift dis- 
tortion effects that are present in the SDSS observations. 

4.1.1. Fixed Smoothing Genus Curve 

The mean galaxy separation in all of our galaxy 
samples based on the Millennium Simulation is about 
5.73/i~^Mpc, therefore we can reasonably safely apply a 
Gaussian smoothing length of 6 h~^Mpc. If the smooth- 
ing length is smaller than l/\/2 times the mean galaxy 
separation, the genus curve tends to show a "meatball 
shift" because the algorithm pick s out indiv idual galaxies 
as isolated high density regions ^Gott et al. 1987, 1989). 

For each of 8 subsamples of the three full galaxy cat- 
alogues, we first compute the density field using the 
cloud-in-cell (CIC) assignment scheme. The galaxies 
are assigned onto 64'^ grids covering the box of ibox — 
250ft.^^Mpc, where the pixel size of 3.91ft-^^Mpc is 
deemed adequate for the smoothing length of 6 h^^Mpc. 
Next, we smooth the galaxy density field by convolv- 
ing it with a Gaussian window function (Eq. llOp at 
A = 6ft.~^Mpc. Finally, for each sample we compute 
the genus at 100 values of ly, where i' is defined in terms 
of the contour's enclosed volume fraction (Eqn.[6]). 

In Fig. m we show the resulting genus curves for 
the CIC method and a Gaussian smoothing with A = 
6h~^Mpc, for different galaxy catalogues: based on 
the CLF approach (upper row panels), the SAM of 
'Croton et al. (2006) (middle row panels), and the SAM 
of iBower et al. (2006) (lower row panels), respectively. 
The panels in the left to right columns give the results 
for galaxies with absolute magnitude within —21.7 < 
o iMr-51og/i < -20.1, -19.8 < o iM^-51og/i < -19.2, 
and -18.8 < "-^Mr - 5 log ft. < -18.3, respectively. In 
each panel, the mean genus curve of the 8 subsamples 
is depicted by the solid line, while the shaded areas in- 
dicates the Ifj scatter calculated from the 8 subsamples. 
Compared to the Gaussian fit, which is shown as the dot- 
ted line, the genus measurements indicate that after CIC 
assignment and Gaussian smoothing with A = 6 /i^^Mpc, 
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Fig. 4. — Genus curves measured for galaxy samples from three different galaxy catalogues constructed for the Millennium Simulation, 
from the upper to lower rows as labeled, using the fixed smoothing technique. The shaded region indicates the standard error of the 8 
subsamples in each model, while the solid line is their mean. The dotted line is the best-fit genus curve for a Gaussian field. The density 
field has here first been constructed on a fine grid using CIC assignment and was then smoothed with a Gaussian of smoothing length 
A = 6/j~^Mpc. Results shown in the left, middle, and right hand columns are for galaxies in different absolute magnitude bins, as labeled, 
where Mj, = ^'^Mr — 5 log h is the r-band absolute magnitude, K + E corrected to redshift z = 0.1. In each panel, we include measurements 
for the amplitude A, the horizontal shift Au, and the abundance diagnostics for clusters and void, Ac and Ay. In each case we cite the 
mean of our 8 measurements and their standard deviations. 
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Fig. 5. — Similar to Fig.|4] but here we compare results for galaxies with —21.7 < "'^Mr — Slogh < —20.1 using three different methods 
for density reconstruction. In the left panel, the density field was constructed using the CIC method with A = 6/i~^Mpc. In the middle 
panel, the density field was constructed using the SPH adaptive kernel method with N^pi-^ = 64 neighbors. The right panel is based on the 
DTFE method and represents a direct genus measurement of the Delaunay tetrahedrafization. In each panel, we show results for galaxy 
subsamples con stru cted using the CLF approach (solid line w ith squares and err or bars), and obtained for the semi-analytic models of 
ICroton et al-j ||2006I) (dotted line with circles and error bars) and lBower et all II2006I) (dashed line with triangles and error bars), respectively. 
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Fig. 6. — Comparison of the genus statistics in different absolute magnitude bins based on different density reconstruction methods and 
for different galaxy catalogues, as indicated in each panel. 



the galaxy density fields are consistent with a Gaussian 

distribution at slightly more than la leve l. 

One step further, we follow iParkeFaLl (|l99l to mea- 
sure the shift parameter At/ of the genus curves, as listed 
in each panel of Fig. 2] In the bright absolute mag- 
nitude bin -21.7 < ^-^Mr - Slog /i < -20.1, we find 
that the CLF and Croton galaxy catalogues have pos- 
itive \v > at about a 2a significance, which means 
that void structu res dominate the galaxy distribution. 
iGott et al.l (|2008l l report a value of Ai/ = 0.010 ± 0.023 
for mock samples from Millennium Simulation, which is 
in agreement with our measurements for galaxies in dif- 



ferent absolute magnitude bins and in different galaxy 
catalogues. 

Following lPark et al.l (|2005l) , we also measured the void 
multiplicity parameter Ay and the cluster multiplicity 
parameter Ac, respectively. As can be seen in Fig. |4l 
the void multiplicity parameter Ay is lower than 1 in all 
the measurements, especially for galaxies in the bright 
absolute magnitude bin -21.7 < "-^M^-Slog/i < -20.1, 
which implies that voids are very empty and coalesce into 
fewer large voids than would be expected for a Gaus- 
sian field. The value Ay slightly rises for fainter galaxies 
in all the three galaxy catalogues. On the other hand. 
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however, the cluster multiphcity parameter Ac is quite 
different among the three galaxy catalogues. It is con- 
sistently below unity in the CLF galaxy catalogue, in- 
dicating that there are fewer independent isolated high 
den sity regions t h an fo r a Gaussian random field. In 
the ICroton et al.l (|2006f ) galaxy catalogue in the other 
hand, there is a trend that the values of Ac rise and 
are consistent with unity only for fainter galaxies. Over- 
all, however we fin d that the amplitude of Ac for the 
I Bower et al.l (|2006( ) galaxy catalogue is consistent with 
unity. Also, Ay shows smaller deviations than found in 
the other two catalogues, suggesting that the galaxy dis- 
tribution in Bower et al. indeed shows subtle differences 
from the other two. 

4.1.2. Genus Curve for an adaptive smoothing kernel 

We next turn to an analysis of the genus based on 
adaptively constructed density fields, considering first 
the 'classic' adaptive kernel estimation technique and 
in the next subsection our new tessellation method that 
works directly with the point set. 

For each of the 8 subsamples of three galaxy cata- 
logues, we compute the density field on a regular 64^ 
grid using the SPH method with a neighbor number of 
^sph = 64 galaxies. Then we calculate the genus curve 
at 100 values of i^. In the middle panel of Fig. [SI we 
show the genus curves based on the SPH method for 
our three theoretical galaxy catalogues in the magnitude 
range -21.7 < ^-^M^ -5\ogh< -20.1. 

Note first that the number of structural elements re- 
solved with the SPH adaptive smoothing is much larger 
than the one accessible with fixed smoothing. Compared 
with the A — 6ft.^^Mpc fixed smoothing results, which 
are shown in the left panel of Fig. [5l we find that the 
-^sph = 64 adaptive smoothing scheme reaches a genus 
density which is approximately three times larger, which 
is due to the considerably better adaptive resolution of 
the corresponding isodensity surfaces. However, similar 
to the fixed smoothing method, the results of the genus 
statistics for the three catalogues are clearly well consis- 
tent with each other, lacking the power of discriminating 
between the different assumptions about the galaxy for- 
mation physics made in the models. 

4.1.3. Genus Gurve for the DTFE technique 

Finally, we consider our new DTFE method for density 
reconstruction and measure the genus statistics directly 
on the Delaunay tessellation defined by the galaxy coor- 
dinates. This method is free of any smoothing parameter, 
and should be able to resolve the largest number of struc- 
tural elements. The results are shown in the right panel 
of Fig. [3 again for three different galaxy catalogues in the 
magnitude range -21.7 < "-^Mr - 51og/i < -20.1. The 
squares with error bars, connected with a solid line, show 
the results for our primary CLF catalogue. The dotted 
and dashed lines give the corresponding results for the 
other two catalogues. Overall, the genus curve shows 
a much larger amplitude than can be resolved with the 
adaptive smoothing, which is a direct consequence of the 
larger resolving power of this method. Interestingly, we 
find that there is a significant genus differences between 
the three galaxy cata logues, espec i ally b etween the first 
two and the SAM of i Bower etall ()2006D . This directly 
demonstrates the improved discriminative power made 



possible by the DTFE genus approach, which here can 
pick up the subtle differences introduced in the galaxy 
distribution due to different assumptions made in the 
theoretical galaxy formation modeling. 

Apart from the different implementation of galaxy for- 
mation processes that cause the different halo occupa- 
tion number distribution for galaxies of given luminos- 
ity, e.g., with -21.7 < "-^Mr - Slog /i < -20.1, we 
consider galaxies of different luminosity ranges. As ex- 
plicitly modeled in the CLF approach introduced by 
Yang et al. (2003), galaxies with different luminosity 
may be hosted by halos of different masses, which in 
turn can have quite different genus behavior. Accord- 
ing to the halo mass distributio n of galaxies with given 
luminosity shown in Fig. 3 of lYang et"al] ()2009D , we 
roughly expect that galaxies with absolute magnitude 
in theVange -21.7 < "-^M^ - Slog /i < -20.1 should 
be hosted by halos of mass ~ 10^^ h~^y[Q if they are 
central galaxies, and in larger halos if they are satellite 
galaxies. Galaxies with -19.8 < ^-^Mr - Slog/i< -19.2 
and -18.8 < "-^M^ - Slog ft, < -18.3 should be hosted 
in halos of mass ~ IO^-H-^Mq and - lO^-^/i-iMg, 
respectively, as central galaxies and in larger halos as 
satellite galaxies. We show and compare in each panel of 
Fig. [S] the genus statistics of galaxies in these three dif- 
ferent magnitude ranges. Results shown in the different 
panels correspond to different galaxy catalogues (left to 
right columns) and different genus measurement meth- 
ods (upper to lower rows). As can be seen, only in the 
three bottom panels based on the DTFE method can we 
clearly distinguish the difference among the genus curves 
for galaxies in different magnitude ranges, at much bet- 
ter than a la level. N ote also, in the DTFE results, 
the IBower et al.l ()2006l ) galaxy catalogue reveals signif- 
icantly smaller differences between galaxies within the 
different absolute magnitude ranges than the other cata- 
logues. This behavior can be well understood if we check 
the halo mass distribution of thes e galaxies: for a given 
luminosity, the IBower et al.l ()2006D model sh ows compar- 
atively large halo mass scatter. And thus in IBower et al.l 
(2006) there is a larger overlap of the halo masses in the 
three different magnitude ranges (see also Fig. 4 in Liu 
et al. 2010). 

4.2. Results for galaxies in redshift space 

Having measured the genus statistics in real space 
using different methods for three different galaxy cat- 
alogues and for galaxies in three different luminosity 
ranges, we now turn to an analysis where observational 
selection effects are included. This also allows us to 
make a direct comparison with the SDSS observations 
in an "apples-to-apples" fashion. We note however that 
we here compare only to theoretical predictions obtained 
for the ACDM cosmology simulated in the Millennium 
Simulation; in the future, it will be very interesting to 
also carry out such a comparison for different cosmo- 
logical models. Also we remark that although we have 
demonstrated in the previous subsections that the genus 
measured with the DTFE method can in principle dis- 
tinguish topological differences between different galaxy 
catalogues constructed for the same simulation model, 
and galaxies within different absolute magnitude ranges, 
we lack the theoretical prediction/judgment of which of 
these models is ultimately the most accurate. 
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Fig. 7. — Comparison of the genus statistics between the SDSS and mock redshift surveys that take into account various observational 
effects, especially the redshift distortion effect, using different density reconstruction methods from left to right, as indicated in the panels. 
The error bars are calculated from 6 mock redshift surveys (writh different pointings) based on the galaxy catalogue constructed using the 
conditional luminosity function. 




Fig. 8. — Comparison of the genus statistics for the DTFE 
method measured from the full simulation box (solid line), from a 
smaller box of equal volume to the SDSS (dotted line) , and from the 
SDSS geometry (dashed line v^ith error bars) for the CLF galaxy 
catalogue based on the Millennium Simulation. The results shown 
in this plot are all measured in real space. The error bars for the 
SDSS geometry results are again calculated from 6 mock redshift 
surveys (with different pointings) based on the CLF catalogue, but 
here in real space. The shaded area is the standard error of 8 
subsamples. 

As an illustration, we here use the SDSS observation to 
judge to what extent the three galaxy catalogues we dis- 
cussed in this paper agree with the observations. Note 
that in SDSS, there are a number of important survey 
selection effects: flux limit, sky mask, redshift distor- 
tion, etc. To have a fair comparison between the ob- 
servations and the galaxies in the three theoretical cata- 
logues, we have generated appropriate mock galaxy red- 
shift surveys (MGRSs). We then compare the genus 
statistics measured for the volume-limited samples with 
-21.7 < °-iMr - 5 log /i < -20.1 extracted from the 



Fig. 9. — Comparison of the genus statistics for the DTFE 
method in real (solid line) and redshift (dotted line) space for 
MGRSs based on the CLF galaxy catalogue. The results in real 
and redshift spaces are the same as those shown in Fig. |8] and the 
right panel of Fig. [7] respectively. 

SDSS observations and for the MGRSs corresponding to 
our three theoretical galaxy catalogues. The main differ- 
ences between these measurements and the ones carried 
out earlier in cubic boxes are: (1) a light cone geometry 
without periodic boundary conditions is used, and (2) 
the analysis is in redshift space. 

In Fig. [71 we show the genus curves of SDSS and the 
MGRSs generated from the three different galaxy cata- 
logues. Results shown in the left, middle and right panels 
are for genus statistics measured using the A = 6 h-^Mpc 
fixed smoothing, the SPH A'sph = 64 adaptive smooth- 
ing, and the DTFE method, respectively. The error bars 
on top of the CLF results are obtained from la variances 
of 6 MGRSs generated from the CLF galaxy catalogue 
by adopting different pointings. As one can see from 
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the plot, the first two genus measurement methods indi- 
cate that the three galaxy catalogues are consistent with 
the SPSS observations at about a l-2a level, with the 
iBower et~a l. (2006) model doing slightly wor se. However, 
the DTFE method clearly shows that the IBower et alJ 
(poos' ) galaxy catalogue is actually inconsistent with the 
SPSS observatio ns at a high significance level (see also 
IChoi et al.ll20T0l) . A gain, this is a strong demonstration 
of the improved discriminative power of the PTFE genus 
measurement technique even when used in redshift space. 

Finally, we examine the systematic effects that our 
SPSS survey selections may induce. For illustration pur- 
poses, we focus only on the PTFE technique and check 
the impacts of the light cone geometry and of redshift 
space distortions. We first investigate the light cone ge- 
ometry effect (according to the SPSS mask) on the shape 
and amplitude of the genus curve. In Fig. [8l we show 
three PTFE genus curves based on the CLE galaxy cat- 
alogue, scaled to the same volume: (i) computed for the 
cubic box of Lbox = 500 /i~^Mpc, (ii) for small boxes of 
equal volume as the SPSS, and (iii) for MGRSs selected 
with exactly the same SPSS geometry. Note that all 
the results shown in this plot are measured in real space. 
The small and large box results are in overall good agree- 
ment at better than l-tr level. However, the results for 
the SPSS geometry do suffer somewhat from the edge 
effects, biasing the genus curve to a lower (by about 2-cr) 
maximum value. 

Next we check the impact of redshift space distortions. 
In Fig. [9l we compare the PTFE genus curves for the 
MGRSs based on the CLE galaxy catalogue, measured 
separately in real and redshift spaces. Both the max- 
imum and minimum values of the genus curves in real 
space are more prominent than those in redshift space, 
and their shapes are also slightly different, emphasiz- 
ing the need of making "apples-to-apples" comparisons 
with the SPSS observations when genus measurements 
are considered. With further tests we have confirmed 
that similar differences also exist in the fixed and adap- 
tive smoothing genus measurements. 

In this paper, we have focused on introducing the 
new PTFE methodology and showing its potential. We 
therefore restricted our analysis to a comparison of the 
genus statistics of SPSS observations with galaxy mod- 
els generated for a single simulation model, the Millen- 
nium Simulation, using only ~ L* galaxies. We defer 
a more detailed comparison with the SPSS observations 
to a forthcoming paper, where also galaxies generated in 
simulations with different cosmological parameters, and 
galaxies in different luminosity bins, etc., are studied. 
This will also further clarify the ability of the PTFE 
genus technique to provide constraints on cosmological 
and galaxy formation parameters. 

5. CONCLUSIONS 

In this study, we introduced a new method for mea- 
suring the genus statistics of isodensity surfaces of the 
galaxy distribution, based on the Pelaunay tessellation 
field estimation for constructing a continuous and piece- 
wise linear galaxy density field. Compared to traditional 
Gaussian smoothing and adaptive SPH smoothing, this 
technique does not require any free parameter, and it al- 
lows the extraction of the largest amount of topological 
information from the point set. 



To demonstrate the abilities of the new method for 
measuring the topology of the large-scale structure com- 
pared to the traditional methods, we have carried out 
genus measurements using galaxies both in real space 
and in redshift space. In real space, we make use of 
three galaxy catalogues constructed for the same under- 
lying Millennium Simulation: one based on the CLE ap- 
proach and the other two generated with semi-analytical 
galaxy formation models. In redshift space, we make use 
of the volume-limited samples extracted from the SPSS 
observational data and the MGRSs generated using the 
three galaxy catalogues. Three types of genus measure- 
ment methods are introduced and applied to those sam- 
ples: (i) CIC assignment with A = 6 h~^Mpc fixed Gaus- 
sian smoothing; (ii) adaptive SPH-like smoothing with a 
neighbor number of Nsph = 64 galaxies; and (iii) Pelau- 
nay tessellation field estimation. 

Based on our various comparisons, we can summarize 
our findings as follows. 

• The traditional genus measurement method based 
on a fixed smoothing can not really distinguish the 
topology of the different galaxy formation models 
considered here. Reassuringly, they are all found 
to be consistent with the SPSS observations. If 
adaptive smoothing is used instead, more topologi- 
cal information can be recovered, but only marginal 
differences between the models at a l-2tT level can 
be detected. 

• The PTFE method has significantly enhanced 
genus measurement power: its larger amplitude 
in G{i') reflects its ability to extract a maximum 
amount of topological information from the galaxy 
density field. 

• Most importantly, the PTFE method can dis- 
tinguish the topology of different galaxy forma- 
tion models and of galaxies in different luminosity 
ranges at very significant confidence levels, both in 
real and in redshift space. 

• Comparing with the SPSS observational data using 
MGRSs that take into account various survey selec- 
tion effects, we find that t he semi-analytical g alaxy 
catalogue constructed bv IBower et al.l ()2006[ ) devi- 
ates from the observations significantly. 

Our results have clearly demonstrated the power of our 
new PTFE method in performing a topological analysis 
both in real and in redshift space. The present work is the 
first study in a series of papers where we want to exploit 
the full potential of the new approach for constraining 
cosmological parameters and the galaxy formation pro- 
cess. It will also be interesting to see how sensitive the 
PTFE method is to traces of non-Gaussianity in the ini- 
tial conditions, or to non-standard dark energy models. 
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